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SAMIR KHULLER *, BALAJI RAGHAVACHARI t, AND NEAL YOUNG * 

Abstract. The MEG (minimum equivalent graph) problem is the following: "Given a directed 
graph, find a smallest subset of the edges that maintains all reachability relations between nodes." 
This problem is NP-hard; this paper gives an approximation algorithm achieving a performance 
guarantee of about 1.64 in polynomial time. The algorithm achieves a performance guarantee of 1.75 
in the time required for transitive closure. 

The heart of the MEG problem is the minimum SCSS (strongly connected spanning subgraph) 
problem — the MEG problem restricted to strongly connected digraphs. For the minimum SCSS 
problem, the paper gives a practical, nearly linear-time implementation achieving a performance 
guarantee of 1.75. 

The algorithm and its analysis are based on the simple idea of contracting long cycles. The 
analysis applies directly to 2- Exchange, a general "local improvement" algorithm, showing that its 
performance guarantee is 1.75. 
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1. Introduction. Connectivity is fundamental to the study of graphs and graph 
algorithms. Recently, many approximation algorithms for finding minimum subgraphs 



that meet given connectivity requirements have been developed [|, [Tl], [ll], 16, 
These results provide practical approximation algorithms for NP-hard network-design 
problems via an increased understanding of connectivity properties. 

Until now, the techniques developed have been applicable only to undirected 
graphs. We consider a basic network-design problem in directed graphs 12. 13 



which is as follows: given a digraph, find a smallest subset of the edges (forming a 
minimum equivalent graph (MEG)) that maintains all reachability relations of the 
original graph. 

When the MEG problem is restricted to strongly-connected graphs we call it the 
minimum SCSS (strongly connected spanning subgraph) problem. When the MEG 
problem is restricted to acyclic graphs we call it the acyclic MEG problem. The 
MEG problem reduces in linear time 0] to a single acyclic problem given by the so- 
called "strong component graph" , together with one minimum SCSS problem for each 
strong component (given by the subgraph induced by that component). Furthermore, 
approximating the MEG problem is linear-time equivalent to approximating both 
restricted versions. 

Moyles and Thompson |l8| observe this decomposition and give exponential-time 
algorithms for the restricted problems. Hsu gives a polynomial-time algorithm 
for the acyclic MEG problem. 

The related problem of finding a transitive reduction of a digraph — a smallest 
set of edges yielding the same reachability relations is studied by Aho, Garey and 
Ullman M. Transitive reduction differs from the MEG problem in that the edges 
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in the transitive reduction are not required to be in the original graph. However, 
the transitive reduction problem decomposes just like the MEG problem into acyclic 
and strongly connected instances. For any strongly connected instance, a transitive 
reduction is given by any Hamilton cycle through the vertices. For an acyclic instance, 
the transitive reduction is unique and, as Aho et al. observe, is equivalent to the MEG 
problem: it consists of those edges (u, v) for which there is no alternate path from u 
to v. In fact, Aho, Garey and Ullman show that the transitive reduction problem is 
equivalent to the transitive closure problem. Thus, the acyclic MEG problem reduces 
to transitive closure. 

The acyclic MEG problem can be solved in polynomial time, whereas the mini- 
mum SCSS problem is NP-hard Q. Consequently, this paper focuses on approxima- 
tion algorithms for the minimum SCSS problem. By the observations of the preceding 
paragraphs, the performance guarantees obtained for the minimum SCSS problem 
carry over to the general MEG problem with the overhead of solving a single instance 
of transitive closure. 

1.1. Our Results. Given a strongly connected graph, our basic algorithm finds 
as long a cycle as it can, contracts the cycle, and recurses. The contracted graph 
remains strongly connected. When the graph finally collapses into a single vertex, the 
algorithm returns the set of edges contracted during the course of the algorithm as 
the desired SCSS. 

The algorithm achieves a performance guarantee of any constant greater than 
7r 2 /6 w 1.645 in polynomial time. We give a nearly linear-time version that achieves 
a performance guarantee of 1.75. We give examples showing lower bounds on the 
performance guarantees of the algorithm. For the general algorithm, the lower bounds 
are slightly above 1.5. For the nearly linear-time version, the lower bound is 1.75, 
matching the upper bound. 

The performance guarantee analysis extends directly to a simple "local improve- 
ment" algorithm called 2-Exchange. 2-Exchange starts with the given digraph and 
performs the following local improvement step as long as it is applicable: find two 
edges in the current graph that can be replaced by one edge from the original graph, 
maintaining strong connectivity. Similar local-improvement algorithms are natural 
candidates for many optimization problems but often elude analysis. We prove that 
the performance guarantee of 2-Exchange is 1.75. 

A natural improvement to the cycle-contraction algorithm is to modify the algo- 
rithm to solve the problem optimally once the contracted graph has no cycles longer 
than a given length c. For instance, for c = 3, this modification improves the per- 
formance guarantee to tt 2 /6 — 1/36 « 1.617. We use SCSS C to denote the minimum 
SCSS problem restricted to digraphs with no cycle longer than c. The minimum SCSS2 
problem is trivial. The minimum SCSS3 problem can be solved in polynomial time, 
as shown by Khuller, Raghavachari and Young However, further improvement in 
this direction is limited: we show that the minimum SCSS5 problem is NP-hard. In 
fact, we show that the minimum SCSS17 problem is MAX SNP-hard. This precludes 
the possibility of a polynomial-time approximation scheme, assuming P^NP p|. 

1.2. Other Related Work. The union of any incoming branching and any 
outgoing branching from the same root yields an SCSS with at most 2n — 2 edges 
(where n is the number of vertices in the graph). This is a special case of the algorithm 
given by Frederickson and JaJa || that uses minimum weight branchings to achieve a 
performance guarantee of 2 for weighted graphs. Since any SCSS has at least n edges, 
this yields a performance guarantee of 2 for the SCSS problem. 
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Any minimal SCSS (one from which no edge can be deleted) has at most 2n — 2 
edges and also yields a performance guarantee of 2. The problem of efficiently finding 
a minimal SCSS is studied by Simon ]2l|| . Gibbons, Karp, Ramachandran, Soroker 
and Tarjan fl0|| give a parallel algorithm. 

A related problem in undirected graphs is to find a smallest subset of the edges 
forming a biconnected (respectively bridge-connected (i.e., 2-edge-connected)) span- 
ning subgraph of a given graph. These problems are NP-hard. Khuller and Vishkin [^5[ [ 
give a DFS-based algorithm that achieves a factor of | for biconnectivity and § for 
bridge-connectivity. Garg, Santosh and Singla |J subsequently improve the approx- 
imation factors, using a similar approach, to | and §, respectively. None of these 
methods appear to extend to the minimum SCSS problem. 

Undirected graphs having bounded cycle length have bounded tree width. Arn- 
borg, Lagergren and Seese || have shown that many NP-hard problems, including the 
minimum biconnected-spanning-subgraph problem, have polynomial-time algorithms 
when restricted to such graphs. 

2. Preliminaries. To contract a pair of vertices u, v of a digraph is to replace 
u and v (and each occurrence of u or v in any edge) by a single new vertex, and to 
delete any subsequent self-loops and multi-edges. Each edge in the resulting graph is 
identified with the corresponding edge in the original graph or, in the case of multi- 
edges, the single remaining edge is identified with any one of the corresponding edges 
in the original graph. To contract an edge (it, v) is to contract the pair of vertices u 
and v . To contract a set S of pairs of vertices in a graph G is to contract the pairs in 
S in arbitrary order. The contracted graph is denoted by G/S. Contracting an edge 
is also analogously extended to contracting a set of edges. 

Let OVT(G) be the minimum size of any subset of the edges that strongly con- 
nects G. In general, the term "cycle" refers only to simple cycles. 

3. Lower Bounds on OVT(G). We begin by showing that if a graph has no 
long cycles, then the size of any SCSS is large. 

Lemma 3.1 (Cycle Lemma) . For any directed graph G with n vertices, if a longest 
cycle of G has length C, then 

OVr{G)>^— x in-\). 

Proof. Starting with a minimum-size subset that strongly connects the graph, 
repeatedly contract cycles in the subset until no cycles are left. Observe that the 
maximum cycle length does not increase under contractions. Consequently, for each 
cycle contracted, the ratio of the number of edges contracted to the decrease in the 
number of vertices is at least . Since the total decrease in the number of vertices 
is n — 1, at least -^j( n ~ 1) edges are contracted. □ 

Note that the above lemma gives a lower bound which is existentially tight. For 
all values of C, there exist graphs for which the bound given by the lemma is equal to 
OVT(G). Also note that C has a trivial upper bound of n and, using this, we get a 
lower bound of n for CPT(G), which is the known trivial lower bound. 

Lemma 3.2 (Contraction Lemma). For any directed graph G and set of edges S, 

OTT(G) > OPT (G/S). 
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Proof. Any SCSS of G, contracted around S (treating the edges of S as pairs), is 
an SCSS of G/S. □ 



4. Cycle-Contraction Algorithm. The algorithm is the following. Fix k to 
be any positive integer. 

CONTRACT-CYCLESfe(G) 

1 for i= k,k-l,k-2,...,2 

2 while the graph contains a cycle with at least i edges 

3 Contract the edges on such a cycle. 

4 return the contracted edges 

In Section |(], we will show that the algorithm can be implemented to run in 
0{ma{m, n)) time for the case k = 3 and in polynomial time for any fixed value of fc. 
It is clear that the edge set returned by the algorithm strongly connects the graph. 
The following theorem establishes an upper bound on the number of edges returned 
by the algorithm. 

Theorem 4.1. CONTRACT-CYCLESfc(G) returns at most Ck ■ OVT(G) edges, 
where 

7T 2 7T 2 1 

<Cfc < 



6 " 6 (fc - l)fc 

Proof. Initially, let the graph have n vertices. Let n.; vertices remain in the 
contracted graph after contracting cycles with i or more edges (i = k,k — 1, 2). 

How many edges are returned? In contracting cycles with at least k edges, at most 
-£zj{n — Hfc) edges are contributed to the solution. For i < k, in contracting cycles 
with i edges, jzj{rii+i — ni) edges are contributed. The number of edges returned is 
thus at most 

k , , v-^ i , / 1 \ v-^ rij — 1 



-(n - n k ) + 7^1 _ "*) - ( 1 + ~k~l ) n + ^ 



- , i (*- i x*- 2 )' 

Clearly CPT(G) > n. For 2 < i < fc, when m vertices remain, no cycle has more 
than i — 1 edges. By Lemmas 3.1 and |3.2|, OVT(G) > jzik(ni — 1). Thus the number 



of edges returned, divided by OVT(G), is at most 
1 + tK) n A_ "-T 1 „, M -+- JL_ln * ^ 1 '■-} 



1 j7 k nj — 1 /-i i 1 \ k ru—l k— 1 1 

fr- 1 / ,y [ESSES < ~ j i y (ESSES = 1 y 1 

OVT(G) {^OPT(G)- n ^ 4=|(^ - 1) ^ z 2 

Using the identity (from |lj], p. 75]) ^ = \, we get 



C/c- 



7T Z 7T 1 ^ 1 

IT - Cfe = T + fc-1 ~ ^ i 2 

2 1 00 -1 

~ T + jfe - 1 ~ i(i + 1) 

7T 2 1 1 



6 fc-1 fc 

7T 2 1 

6 + (fc-l)fc' 



□ 

If desired, standard techniques can yield more accurate estimates of Cf., e.g., 
Ck = \ + + O (-p-) . If the graph initially has no cycle longer than £ (£ > k), then 
(as pointed out by an anonymous referee) the analysis can be generalized to show 
a performance guarantee of k 1 _~j s -i — h Yli^i V* ■ For instance, in a graph with no 
cycle longer than 5, the analysis bounds the performance guarantee (when k = 5) by 
1.464. 

Table gives lower and upper bounds on the performance guarantee of the algo- 
rithm for small values of k and in the limit as k — -> oo. The lower bounds are shown 
in the next subsection. 



k 


Upper Bound 


Lower Bound 


3 


1.750 


1.750 


4 


1.694 


1.666 


5 


1.674 


1.625 


oo 


1.645 


1.500 



Table 1 

Bounds on the performance guarantee 



4.1. Lower Bounds on the Performance Ratio. In this section we present 
lower bounds on the performance ratio of Contract-Cycles^ (G) . The graph in 
Fig. |l| has 2 ^_ 2 groups of vertices. Each group consists of a (2k — 2)-cycle "threaded" 
with a /c-cycle. 

In the first iteration, Contract-Cycles^G) can contract the fc-cycle within 
each group, leaving the graph with only 2-cycles. The algorithm subsequently must 
contract all the remaining edges. Thus, all the (3fc — 2) 2fc "_ 2 — 2 edges are in the 
returned SCSS. The graph contains a Hamilton cycle and the optimal solution is thus 
n. Hence, for arbitrarily large n, 1 + 2 k-2 ~ IS a l° wer bound on the performance 
guarantee of Contract-Cycles^ (G). As k approaches oo, the lower bound tends 
to 1.5. 




Fig. 1. Bad example for Contract-Cycles^C). 



5. 2-Exchange Algorithm. In this section, we use the cycle-contraction anal- 
ysis to show that 2-Exchange has a performance guarantee of 1.75. 2-Exchange is 
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a special case of /c-Exchange, which is defined as follows. 
/VExchange(G = (V, E)) — Local improvement algorithm 

1 E' <- E 

2 while the following improvement step is possible 

3 Pick a set Ej. of k edges in E 1 and a set -Efe-i of up to k — 1 edges in E 
such that the set of edges E" = (E' - E k ) U E k - X forms an SCSS. 

4 E' <- 

5 return i5' 

Note that for fixed fc, each step can be performed in polynomial time and it reduces 
the size of E' , so fc-ExCHANGE runs in polynomial time. The following theorem shows 
that the approximation factor achieved by 2-Exchange is 1.75. 

Theorem 5.1. The performance guarantee o/2-Exchange is 1.75. 

Proof. We will show that the edges output by 2-Exchange(G) could be output by 
Contract-Cycles 3 (G). Thus, the performance guarantee of 1.75 for Contract- Cycles 3 | 
carries over to 2-Exchange. 

First we show that the performance guarantee is at most 1.75. Let E' be the set of 
edges returned by 2-Exchange(G = (V, E)). Run Contract-Cycles 3 on the graph 
G' = (V,E'). Let H be the set of edges contracted during the first iteration when 
cycles of at least three edges are contracted. The resulting graph G' / H is strongly 
connected and has only 2-cycles. Such a graph has a tree-like structure. In particular, 
an edge (u,v) is present iff the reverse edge (v,u) is present. 

The important observation is that G/H is equivalent to G' /H. Clearly G' /H is 
a subgraph of G/H; to prove the converse, suppose that some edge (u,v) of G/H 
was not in G'/H. Consider adding edge (it, v) to G' /H. By the structure of G'/H, 
u and v are not adjacent in G'/H and for each edge on the path from v to u the 
reverse edge is also in G'/H. If (it, v) is added to G'/H, these (at least two) reverse 
edges can be deleted from G'/H without destroying the strong connectivity of G'/H. 
Consequently, the original edge in G corresponding to (u, v) can be added to G' and 
the original edges in G' corresponding to the reverse edges can be deleted from G' 
without destroying the strong connectivity of G' . This contradicts the fact that E' 
was output by 2-Exchange(G), since E' is eligible for an improvement step. 

Next consider executing Contract-Cycles 3 (G). Since G/H is equivalent to 
G'/H, the sequence of cycles chosen in the first iteration of Contract- Cycles 3 (G') 
could also be chosen by the first iteration of Contract-Cycles 3 (G). Similarly, the 
second iteration in Contract-Cycles 3 (G') could be mimicked by 
Contract-Cycles 3 (G), in which case Contract-Cycles 3 (G) would return the 
same edge set as Contract-Cycles 3 (G'). Since E' is minimal (otherwise an im- 
provement step applies), the edge set returned is exactly E' . Thus, the upper bound 
on the performance guarantee of Contract-Cycles 3 from Theorem 4.1 is inherited 
by 2-Exchange. □ 

For the lower bound on the performance guarantee, given the graph in Fig. ^|, 2- 
Exchange can choose a number of edges arbitrarily close to 1.75 times the minimum. 
There are j groups with 4 vertices in each group. First observe that the graph has a 
directed Hamilton cycle. The edges marked in Fig. form a solution that 2-Exchange 
could terminate with. This solution clearly has edges. This gives the lower bound 
of 1.75 on the performance of the algorithm. 

6. Implementation. For any fixed k, Contract-Cycles?; can be implemented! 

in polynomial time using exhaustive search to find long cycles. For instance, if a cycle 
of size at least k exists, one can be found in polynomial time as follows. For each 
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^ Edges returned by 2-Exchange 
— > 

Edges not used by 2-Exchange 
Fig. 2. Worst-case example for 2 -Exchange. 

simple path P of k — 1 edges, check whether a path from the head of P to the tail 
exists after P's internal vertices are removed from the graph. If k is even, there are 
at most m k / 2 such paths; if k is odd, the number is at most nm^ fe_1 ^ 2 . It takes 
0(m) time to decide if there is a path from the head of P to the tail of P. For the 
first iteration of the for loop, we may have 0(n) iterations of the while loop. Since 
the first iteration is the most time consuming, the algorithm can be implemented in 
0{nm 1+k / 2 ) time for even k and 0(n 2 mf- k+1 ^ 2 ) time for odd k. 

6.1. A practical implementation yielding 1.75. Next we give a practical, 
near linear-time implementation of C0NTRACT-CYCLES3. The performance guarantee 
achieved is C3 = 1.75. C0NTRACT-CYCLES3 consists of two phases: (1) repeatedly 
finding and contracting cycles of three or more edges (called long cycles), until no 
such cycles exist, and then (2) contracting the remaining 2-cycles. 

High-level description of the algorithm.. To perform Phase (1), the algorithm does 
a depth-first search (DFS) of the graph from an arbitrary root. During the search, 
the algorithm identifies edges for contraction by adding them to a set S. At any point 
in the search, G' denotes the subgraph of edges and vertices traversed so far. The 
rule for adding edges to S is as follows: when a new edge is traversed, if the new edge 
creates a long cycle in G' /S, the algorithm adds the edges of the cycle to S. The 
algorithm thus maintains that G' / S has no long cycles. When the DFS finishes, G' / S 
has only 2-cycles. The edges on these 2-cycles, together with S, are the desired SCSS. 

Because G' / S has no long cycles and the fact that the original graph is strongly 
connected, G' / S maintains a simple structure: 

Lemma 6.1. After the addition of any edge to G' and the possible contraction of a 
cycle by adding it to S: (i) The graph G' / S consists of an outward branching and some 
of its reverse edges, (ii) The only reverse edges that might not be present are those 
on the "active" path: from the super-vertex containing the root to the super-vertex in 
G' I S containing the current vertex of the DFS. 

Proof. Clearly the invariant is initially true. We show that each given step of 
the algorithm maintains the invariant. In each case, if u and w denote vertices in the 
graph, then let U and W denote the vertices in G' / S containing u and w, respectively. 

When the DFS traverses an edge (u,w) to visit a new vertex w:. Vertex w and 
edge (u, w) are added to G 1 . Vertex w becomes the current vertex. In G'/S, the 
outward branching is extended to the new vertex W by the addition of edge (U, W). 
No other edge is added, and no cycle is created. Thus, part (i) of the invariant is 
maintained. The super-vertex containing the current vertex is now W, and the new 
"active path" contains the old "active path" . Thus, part (ii) of the invariant is also 
maintained. 
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When the DFS traverses an edge (u, w) and w is already visited:. If U = W or 
the edge (U, W) already exists in G'/S, then no cycle is created, G' / S is unchanged, 
and the invariant is clearly maintained. Otherwise, the edge (u, w) is added to G' and 
a cycle with the simple structure illustrated in Fig. || is created in G'/S. The cycle 
consists of the edge (U,W), followed by the (possibly empty) path of reverse edges 
from W to the lowest-common-ancestor (lea) of U and W, followed by the (possibly 
empty) path of branching edges from lca(t/, W) to U. Addition of (U, W) to G'/S and 
contraction of this cycle (in case it is a long cycle) maintains part (i) of the invariant. 
If the "active path" is changed, it is only because part of it is contracted, so part (ii) 
of the invariant is maintained. 

When the DFS finishes visiting a vertex w:. No edge is added and no cycle is 
contracted, so part (i) is clearly maintained. Let u be the new current vertex, i.e., u>'s 
parent in the DFS tree. If U = W, then part (ii) is clearly maintained. Otherwise, 
consider the set D of descendants of w in the DFS tree. Since the original graph is 
strongly connected, some edge (x, y) in the original graph goes from the set D to its 
complement V — D. All vertices in D have been visited, so (x,y) is in G' . By part 
(i) of the invariant, the vertex in G'/S containing x must be W, while the vertex in 
G'/S containing y must be U. Otherwise the edge corresponding to (x, y) in G'/S 
would create a long cycle. □ 

The algorithm maintains the contracted graph G'/S using a union-find data struc- 
ture |2^| to represent the vertices in the standard way and using three data structures 
to maintain the branching, the reverse edges discovered so far, and the "active path" . 
Wh en a cycle arises in G'/S, it must be of the form described in the proof of Lemma 
6.1 and illustrated in Fig. 0. Using these data structures, the algorithm discovers it 
and, if it is long, contracts it in a number of union-find operations proportional to the 
length of the cycle. This yields an 0(ma(m, n))-time algorithm. 

The vertices of G'/S are represented in union-find sets as follows: 
Make-Set(w): Adds the set {v} corresponding to the new vertex of G'/S. 
Find(w): Returns the set in G'/S that contains vertex v. 

Union(u, v): Joins into a single set the two sets corresponding to the vertices in G'/S 
containing G"s vertices u and v. 
The data structures representing the branching, reverse edges, and the active 



paths, respectively are: 

from-root[VF]: For each branching edge (U,W) in G'/S, from-root[W] = (u,w) for 

some (u, w) € (U x W) n E. 
to-root[C/]: For each reverse edge (U,W) in G'/S, to-root[J7] = (u, w) for some 

(u,w) G (U xW)DE. 
to-active [U]: For each vertex U on the "active path" in G'/S, to-active[J7] = (u,w) 

where (u, w) G (U x IF) (1 -E and IF is the child of U for which the recursive 

DFS call is currently executing, unless no recursive DFS is executing, in which 

case to-active[f/] = current. 

For all other vertices, to-active[{7] = nil. 
Pseudo-code for the algorithm is given in Figures |^ and ||. 

Contract-Cycles 3 (G = (V,E)) Pseudo-code. 

1 S^{} 

2 Choose r eV. 

3 DFS(r) 

4 Add 2-cycles remaining in G'/S to S. 

5 return S 
DFS(u) 

1 to-active[FiND(w)] <— current 

2 for each vertex w adjacent to u - traverse edge (u, w) - 



3 if (w is not yet visited) — new vertex — 

4 Make-Set(w) 

5 to-active[FiND(w)] <— from-root[FiND(w)] <— (u,w) 

6 DFS(w) 

7 to-active[FiND(w)] <— current 

8 else - edge creates cycle in G' / S — 

9 if (Find(u) ^ Find(w)) - cycle length at least 2 - 

10 (x, y) <— from-root[FlND(u)] 

11 if (Find(x) = Find(w)) - length two cycle through parent, U — W — U - 

12 to-root[FiND(u)] <— {u,w) — record edge to parent — 

13 else 

14 {x,y) <— from-root[FiND(u;)] 

15 if (Find(cc) 7^ Find(u)) - not a forward edge to child; therefore length of cycle > 3 - 

16 Contract-Cycle(w) 

17 S <- SU{(u,w)} 



18 to-active[FiND(w)] <— nil 

Fig. 4. Practical implementation of C0NTRACT-CYCLES3 

By the preceding discussion, the algorithm implements C0NTRACT-CYCLES3. It 
is straightforward to show that it runs in 0{ma{m,n)) time. Hence, we have the 
following theorem. 

Theorem 6.2. There is an 0{ma{m,n))-time approximation algorithm for the 
minimum SCSS problem achieving a performance guarantee of 1.75 on an m-edge, 
n-vertex graph. Here a(m, n) is the inverse-Ackermann function associated with the 
union-find data structure |22| . 

Example to Illustrate Algorithm.. In the example in Fig. ||, the algorithm begins 
the DFS from vertex 1. It visits vertices 2,3,4 and then traverses the reverse edge (4, 2). 
Since this edge creates a 3-cycle (2, 3), (3, 4), (4, 2) in G'/S it contracts the cycle. Next 
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Contract-Cycle(w) 

1 while (to-active[FiND(w)] ^ current) do 



2 if (to-active[FiND(w)] = nil) then - Go up towards I. c. a. along reverse edges. - 

3 (c,p) <— to-rootfFlND^)] 

4 a <— to-active[FiND(p)] 

5 else — Go down from I. c. a. along active path. — 

6 (p, c) <— to-active[FiND(w)] 

7 a <— to-active[FiND(c)] 

— Contract parent p and child c. — 

8 / <— from- root [Find (p)} 

9 £ <— to- root [Find (p)] 

10 Union(p, c) 

11 to-active[FiND(u;)] <— a 

12 from-root[FlND(w)] <— / 

13 to-root[FiND(w)] <— f 



Fig. 5. Subroutine Contract-Cycle 



it traverses the reverse edge (3, 1), but does not contract it since it forms only a 2- 
cycle in the contracted graph. Continuing the DFS, it visits vertices 5 and 6. When it 
traverses the edge (6, 4), it discovers and contracts the cycle (3, 1), (1, 5), (5, 6), (6, 4). 
Next it visits vertices 7 and 8, traversing the reverse edges (8,7) and (7,6). Travers- 
ing the edge (6,8), it discovers and contracts the 3-cycle (8, 7), (7, 6), (6, 8). In this 
example, no 2-cycles remain, so it returns just the contracted edges. 

7. Potential Improvement of Contract-Cycles^. A natural modification 
to Contract- Cycles ^ would be to stop when the contracted graph has no cycles 
of length more than some c and somehow solve the remaining problem optimally. 



For instance, for c = 3, by following the proof of Theorem 4.1, one can show that 
this would improve the performance guarantee of Contract-Cycles^ to Cfc — 1/36 
(for k > 4), matching the lower bound in Table |l|. (The lower bound given holds for 
the modified algorithm.) 

This leads us to consider the minimum SCSS C problem — the minimum SCSS 
problem restricted to graphs with cycle length bounded by c. The following theorem 
is shown in (jl4j. 

Theorem 7.1. There is a polynomial-time algorithm for the SCSS3 problem. 
We make no conjecture concerning the SCCS4 problem. However, we next show 
that the SCCS 5 problem is NP-hard, and that the SCSSi 7 problem is MAX SNP-hard. 

7.1. NP-hardness of SCSS5. We prove the following theorem. 
Theorem 7.2. The minimum SCSS5 problem is NP-hard. 

Proof. The proof is by a reduction from SAT [|| . Fix an arbitrary CNF formula 
F. We will build a rooted digraph such that any SCSS contains all the edges out of 
the root (d of them) and such that F is satisfiable iff there exists an SCSS E 1 in which 
each of the remaining n — 1 non-root vertices has out-degree equal to one. Thus the 
formula will be satisfiable if and only if there is an SCSS with n — 1 + d edges. 

The graph has a fixed root vertex r and a vertex for each clause in F (these 
vertices are not shown in Fig.. 0). Each clause vertex has a return edge to the root. 
For each variable in F, the graph has an instance of the gadget illustrated in Fig. 0. 
The edges into the gadget come from the root. Each such edge is present in any 
SCSS. The edges out of the gadget are alternately labeled + and — . For every clause 

10 



Graph G 



After traversing edge (3,1) 



After traversing edge (6,4 




After traversing edge (7,6) After traversing edge (6,8) Edges in C 



Fig. 6. Example to illustrate execution of algorithm. 

with a positive instance of the variable, one of the + edges goes to the clause vertex. 
For every clause with a negative instance of the variable, one of the — edges goes to 
the clause vertex. Unassigned + and — edges go to the root. (The gadget is easily 
enlarged to allow any number of occurrences.) 

The key property of the gadget is that if every (non-root) vertex has out-degree 
one in some SCSS, then either all of the counter-clockwise edges are in the SCSS 
(corresponding to the variable being true) or all of the clockwise edges are in the 
SCSS (corresponding to the variable being false). Thus, given any SCSS of d + n — 1 
edges, where d is the outdegree of the root and n is the number of vertices in the 
digraph constructed, it is easy to construct a satisfying assignment for F. Conversely, 
given any satisfying assignment for F, is is easy to construct an SCSS of size d + n — 1. 
□ 

7.2. MAX SNP-hardness of SCSSi 7 . Next we consider the MAX SNP- 
hardness of the problem. The proof uses a reduction from the vertex-cover problem 
in bounded-degree graphs to the SCSS problem. Since the proof follows closely the 
reduction from vertex cover to Hamiltonian circuits (see Q), it is suggested that the 
reader study this reduction before reading this subsection. It is known that the prob- 
lem of finding a minimum vertex cover is MAX SNP-hard in graphs whose maximum 
degree is bounded by seven ^ . 

Let G be a connected, undirected graph whose maximum degree is bounded by 
seven. Let G have m edges and n vertices. We construct a digraph D with 2m + 1 
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vertices and no cycle longer than 17. Any vertex cover of G of size s will yield an 
SCSS of I? of size 2m + s, and vice versa. We then show that, since G has 0(n) edges, 
this yields an L-reduction (i.e., an approximation-preserving reduction fl9|| ). 

7.2.1. The construction of D. Applying Vizing's theorem |23], color the edges 
of G in polynomial time with at most eight colors so that no two edges incident to a 
vertex share the same color. Let the colors of the edges be one of {1,2,..., 8}. 

The construction begins with a special "root" vertex r in D with outgoing arcs 
to each of n vertices v\, . . . , v n , corresponding to the vertices of G. An arc from r to 
Vi is labeled vf . Each of these vertices form the beginning of a path corresponding 
to each vertex of G. 




V 

Fig. 8. A cover-testing component. 



As the construction proceeds, each vertex in G will have a "current vertex" in 
D, initially the start vertex. For each edge (u,v), in order of non-decreasing color, 
add a "cover-testing gadget" to D, as illustrated in Fig. [|. Specifically, add two new 
vertices x and y. Add two edges into x: the first, labeled u + , from the current vertex 
of it; the second, labeled u~ , from y. Similarly, add two edges into y: the first, labeled 
v + , from the current vertex of u; the second, labeled u - , from x. Make y the new 
current vertex of it; make x the new current vertex of v. Finally, after all edges of G 
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have been considered, for each vertex v in G, add an edge labeled v + from its final 
current vertex to the root. The gadgets are implicitly layered, with each gadget being 
assigned to a layer corresponding to the color of the associated edge in G. 

Lemma 7.3. The graph D constructed above has no cycle with more than 17 
edges. 

Proof. We first assign numbers to the vertices of D. The root r is assigned the 
number 0. The construction above proceeds in the order of increasing color of the 
edges of G. When considering an edge (u, v) of color c, we add two new vertices: x 
is added to v's path and y is added to u's path. We assign the vertices x and y the 
number c. Consider any cycle X of length greater than two in D. It is clear that 
such a cycle must pass through r, since D is layered. Hence the cycle is of the form 
(r, Xi, X2, ■ ■ ■ ,Xk, r). Because we considered the edges in order of increasing color, the 
numbers assigned to the vertices in X increases at least every two steps in any path in 
D (not including r). In other words, the numbers assigned to the vertices Xi,. . . ,Xk 
forms a non-decreasing sequence in which no three consecutive vertices get the same 
number. Since the edges of G were colored with 8 colors, the numbers assigned to the 
vertices of D range from to 8 (only r gets the number 0). Combining all these, the 
length of the cycle X is at most 17. □ 

7.2.2. The analysis. We now show that every vertex cover of G has a corre- 
sponding SCSS in D. The proof is similar to the corresponding proof (in the reduction 
from vertex cover to Hamiltonian circuits) that every vertex cover has a corresponding 
Hamiltonian circuit. Consider an arbitrary vertex cover S of G. The idea is to choose 
in the SCSS the paths corresponding to S in D. The paths of the vertices of V — S 
are yet to be connected. Since S forms a vertex cover, the vertices in the paths of 
V — S can be connected using the cover-testing components. 

Lemma 7.4. Given a vertex cover of size s of G, an SCSS of D of size 2m + s 
can be constructed. 

Proof. Construct a subgraph H of D as follows. For each vertex u in G, let d be 
the degree of u in G. If u is in the vertex cover, add the d + 1 edges labeled u + in D 
to H. Otherwise, add the d edges labeled u~ in D to H . It is easy to verify that H 
has the following properties: 

1. H has 2m + s edges. 

2. H has no cycles of length 2. 

3. Every vertex of H has at least one outgoing and at least one incoming edge. 
As mentioned earlier, D is layered and every cycle of length greater than 2 contains 
r. Therefore Property 2 above implies that every cycle of H passes through r. By the 
above conditions, H contains a path from r to every vertex v and another path from 
v to r, and is therefore strongly connected. To obtain a path from any v to r, start 
from v and keep traversing an outgoing edge (which exists by Property 3) from the 
current vertex. Such a path must eventually reach r because r is contained in every 
cycle of H. Hence H satisfies the lemma. □ 

We now show that every SCSS of D corresponds to a vertex cover of G. The proof 
works by showing that any SCSS can be converted into a "canonical" SCSS whose 
size is no larger, that corresponds to a vertex cover of G. 

Lemma 7.5. Given an SCSS in D of size 2m + s, a vertex cover of G of size s 
can be constructed. 

Proof. First, as long as some non-root vertex y has both of its incoming edges in 
the SCSS, modify the SCSS as follows. Let (x,y) be the edge labeled v~ for some v. 
Remove the edge (x,y) and add the other edge out of x, if it is not already present. 
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Alternatively, if some non-root vertex x has both of its outgoing edges in the SCSS, 
remove the edge (x, y) and add the other edge into y. Repeat either modification as 
long as applicable. 

By the layering of D, each modification maintains the strong connectivity of the 
SCSS. Clearly none of the modifications increases the size. Each step reduces the 
number of edges labeled u~ for some u in the SCSS, so after at most 2m steps, 
neither modification applies, and in the resulting SCSS every non-root vertex has 
exactly one incoming edge and one outgoing edge in the SCSS. 

An easy induction on the layering shows that for any vertex v in G, either all of 
the edges labeled v + in D are in the SCSS or none are, in which case all of the edges 
labeled v~ are in the SCSS. Let C be the set of vertices in G of the former kind. It is 
easy to show that the size of the SCSS is 2m + \C\, so that |C| < s. For every edge 
(u, v) in G, the form of the gadget ensures that at least one of the two endpoints is 
in C. Hence, C is the desired cover. □ 

Theorem 7.6. The minimum SCSSrj problem is MAX SNP-hard. 

Proof. Let G be an arbitrary undirected graph G whose maximum degree is 
bounded by seven. Let G have m edges and n vertices. Construct the digraph D 
as shown earlier. By Lemma 7.3, D has no cycles greater than 17. By Lemma 7.4, 
any vertex cover of G of size s can be used to obtain an SCSS of D of size 2m + s. 



Conversely, by Lemma 7.5, an SCSS of D of size 2m + s can be used to obtain a 
vertex cover of G of size s. Since the degree of G is bounded, m = 0(n) = O(s) and 
it is easily verified that this yields an L-reduction from degree-bounded vertex cover 
to the minimum SCSS17 problem. □ 



8. Open Problems. An obvious problem is to further characterize the various 
complexities of the minimum SCSS& problems. 

The most interesting open problem is to obtain a performance guarantee that is 
less than 2 for the weighted strong connectivity problem (as mentioned earlier, the 
performance factor of 2 is due to Frederickson and JaJa || ) . Such an algorithm may 
have implications for the weighted 2-connectivity problem jl5| in undirected graphs 
as well. 

The performance guarantee of /c-Exchange probably improves as k increases. 
Proving this would be interesting — similar "local improvement" algorithms are ap- 
plicable to a wide variety of problems. 
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